Performance Analysis of Parallelizing Program Code for Multi-Core Architectures Using OPENMP

نویسندگان

  • Neha Agrawal
  • Ruchira Datta
  • M Rajasekhara Babu
  • Pradeep Reddy
چکیده

Multi-core architectures have become more popular due to better performance, reduced heat dissipation, power consumption and more efficient simultaneous processing of multi tasks. If one wishes to run a single application faster, then that application must be divided into sub programs, or threads, that cooperate to deliver the desired functionality. The Open MP programming paradigm implements loop level parallelism, which is one of the most basic available units of parallelism for parallel Open MP programs [3]. Loop-level parallelism allows an Open MP implementation to easily split the work across multiple threads. This paper mainly focuses on Parallelizing programming code using Open MP and analyzes the program performance using V-tune.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Early experiences and results on parallelizing discrete dislocation dynamics simulations on multi-core architectures

Materials science simulations are among the leading applications for scientific supercomputing. Discrete dislocation dynamics (DDD) is a numerical tool used to model the plastic behavior of crystalline materials using the elastic theory of dislocations. DDD simulations require very long running times to produce meaningful scientific results. This work presents early experiences and results on i...

متن کامل

Experiences with Achieving Portability across Heterogeneous Architectures

The increasing computational needs of parallel applications inevitably require portability across popular parallel architectures, which are becoming heterogeneous. The lack of a common parallel framework results in divergent code bases, difficulty in porting, higher maintenance cost, and, thus difficulty achieving optimal performance on target architectures. Our paper examines two representativ...

متن کامل

Coarse-Grain Task Parallel Processing Using the OpenMP Backend of the OSCAR Multigrain Parallelizing Compiler

This paper describes automatic coarse grain parallel processing on a shared memory multiprocessor system using a newly developed OpenMP backend of OSCAR multigrain parallelizing compiler for from single chip multiprocessor to a high performance multiprocessor and a heterogeneous supercomputer cluster. OSCAR multigrain parallelizing compiler exploits coarse grain task parallelism and near ne gra...

متن کامل

Towards Autotuning of OpenMP Applications on Multicore Architectures

In this paper we describe an autotuning tool for optimization of OpenMP applications on highly multicore and multithreaded architectures. Our work was motivated by in-depth performance analysis of scientific applications and synthetic benchmarks on IBM Power 775 architecture. The tool provides an automatic code instrumentation of OpenMP parallel regions. Based on measurement of chosen hardware ...

متن کامل

Performance Analysis and Optimization of the OP2 Framework on Many-Core Architectures

This paper presents a benchmarking, performance analysis and optimization study of the OP2 ‘active’ library, which provides an abstraction framework for the parallel execution of unstructured mesh applications. OP2 aims to decouple the scientific specification of the application from its parallel implementation, and thereby achieve code longevity and near-optimal performance through re-targetin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012